Learning to Ride a Bicycle using Iterated Phantom Induction

نویسندگان

  • Mark Brodie
  • Gerald DeJong
چکیده

We build upon our work on iterated phan. ... . .. . -.. . wm IDQUC1O10D anQ Ule worK 01 naDQltZ1V anQ Alstrem OD -applying reinforcemeni learning to the challenging control task of learning to ride a bicycle. Last year phantom induction was demonstrated on a nonlinear but non-dynamica1 task. The bicycle domain is a dynamical system whose next state is a function of the current state and control inputs.. Randl,v and AlstrfIIm demonstrated this task was learnable using Sarsa(~) with shaping. Our approach integrates domain knowledge into the learning process in a flexible way. By taking advantage of what a domain theory expert can easily express, our .system learns to ride. in a few dozen examples rather than the thousands required by the system of Randl,v and AIstr,m. We also show that the two inputs, shifting the rider's center of mass and deflecting the handlebars, are not needed simultaneously for learning. Our system learns to ride using either input alone. By extending phantom induction into the space of dynamical systems we demonstrate the applicability of our learning technique to an important and difficult class of problems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

It Takes Two Neurons To Ride a Bicycle

Past attempts to get computers to ride bicycles have required an inordinate amount of learning time (1700 practice rides for a reinforcement learning approach [1], while still failing to be able to ride in a straight line), or have required an algebraic analysis of the exact equations of motion for the specific bicycle to be controlled [2, 3]. Mysteriously, humans do not need to do either of th...

متن کامل

Blitzograms - Interactive Histograms

You probably know how to ride a bicycle, but would you even recognize the equations of bicycle motion? The equations are clearly not necessary for riding a bike, but are they sufficient? Suppose, that upon seeing your first bicycle, you had derived the equations of motion from scratch, starting with F=ma. Would you shout “Eureka” then jump on and start riding? No, the equations of motion are ir...

متن کامل

Oats ( not just for breakfast anymore )

2 3 The bicycle without a rider balances perfectly well. With a novice rider, it will fall. This is because the novice has the wrong intuitions about balancing and freezes the position of the bicycle so that its own corrective mechanism cannot work freely. Thus learning to ride does not mean learning to balance, it means learning not to unbalance, learning not to interfere.

متن کامل

Bicycle Access to Public Transportation: Learning from Abroad

w bile the United States has been investing in costly park-and-ride systems that have made transit increasingly dependent on the automobile, European and Japanese communities have been strengthening the potential for people to walk and bicycle to and from transit, boosting ridership at a far lower cost. In Japan and much of Europe, the fastest-growing and often predominant access mode to suburb...

متن کامل

A New Method to Enhance the Low Voltage Ride Through Capability of Doubly Fed Induction Generator Using Resistive Type Superconducting Fault Current Limiter

In the event of voltage sag at the nearing of a wind park, severe currents may pass through the power electronic devices of the Doubly Fed Induction Generators (DFIGs). Hence, according to the Low Voltage Ride Through (LVRT) requirements, the wind parks are allowed to disconnect from the network after a certain time. Therefore, to avoid such actions, the LVRT requirement of wind parks is consid...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999